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A REPRESENTATIONAL APPROACH TO DNA ANALYSIS 

CROSS-REFERENCE TO REI ATED APPLICATIONS 
This application is a continuation-in-part of application 
serial no. 07/974,447, filed November 12, 1993. 

5 INTRODUCTION 
Technical Field 

The field of this invention is DNA analysis. 

Background 

Comparative genomic DNA analysis holds promise for the 
10 discovery of sequences which may provide for information 
concerning polymorphisms, infectious DNA based agents, lesions 
associated with disease, such as cancer, inherited dominant and 
recessive traits, and the like. By being able to detect 
particular DNA sequences which have a function or affect a 
15 function of cells, one can monitor pedigrees, so that in 
breeding animals one can follow the inheritance of particular 
sequences associated with desirable traits. In humans, there 
is substantial interest in forensic medicine, diagnostics and 
genotyping, and determining relationships between various 
20 individuals. There is, therefore, substantial interest in 
providing techniques which allow for the detection of common 
sequences between sources and sequences which differ between 
sources. 

The mammalian genome is extraordinarily large, having about 
2 5 6 x 10 9 bp. The human genome project has initiated an effort 
to map and sequence the entire genome. However, much of the 
early work will be directed more toward determining the site 



WO 94/11383 



PCT/US93/10722 



of particular genes, than determining contiguous sequences of 
a particular chromosome. 

Because of the complexity of the human genome, there is a 
very substantial handling and processing problem with the human 
5 genomic DNA. In order to deal with such a large amount of DNA, 
one must develop processes which allow for simplification and 
selection, while still providing the desired information. 
Therefore, efforts must be made which will provide for 
opportunities which will allow to greater or lesser degrees, 
10 dissecting portions of a genome of interest, where comparisons 
can be made between two different sources of DMA. 

Relevant Literature 

Efforts at difference analysis at the level of the genome are 
described by Lamar and Palmer, Cell 37, 171 (1984); KunJcel 
15 et al., Proc. Natl. Acad. Sci. USA 82, 4778 (1985); Nussbaum 
et al., Proc. Natl. Acad. Sci. USA 84, 6521 (1987); Wieland 
et al., Proc. Natl. Acad. Sci. USA 87, 2720 (1990); Straus and 
Ausubel, Proc. Natl. Acad. Sci. USA 87, 1889 (1990). 

SUMMARY OF THE INVENTION 

2 0 Representational difference analysis is provided to determine 

similarities or differences between two related sources of DNA. 
In a first st*p, a representative portion of each genome is 
prepared, using a restriction endonuclease (RE1) , ligation of 
partially double-stranded adaptors, and the polymerase chain 
25 reaction, and cleavage with RE1 to provide a population of 
relatively small DNA fragments referred to as "amp) icons. " 
This stage may be repeated in separate analyses with different 
restriction endonucleases or different schemes, e.g., 
fractionation. 

3 0 The first amplicon of source DNA is referred to as the 

"driver," which amplicon is used in substantial excess in the 
subsequent processing of the other, "tester" amplicon. The 
tester includes the "target" DNA, which DNA is absent in or is 
present in reduced amounts in driver amplicon. Partially 
3 5 double-stranded PCR adaptors are ligated only to tester 
amplicon fragments, and the tester and driver DNA combined, 
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melted and reannealed. The termini of the amplicons are filled 
in and using primers complementary to the adaptors, the DNA 
mixture is subjected to amplification, wherein the target DNA 
will undergo exponential amplification and be substantially 
5 enriched as compared to driver DNA and non- target tester DNA, 
which anneals to the driver DNA. Adaptors may then be removed 
and the cycle repeated using different adaptors. Various 
modifications may be employed at different stages to further 
enhance selection of the target DNA, 



10 BRIEF DES CRIPTION OF THE DRAWINGS 

Fig. l is a gel electrophoresis and genomic blot analysis of 
the application of RDA to isolate probes that detect gene 
amplification; 

Fog. 2 is a gel electrophoresis analysis of gene 
15 amplification using drivers from different sources; 

Fig. 3 is a sequence comparison of difference product P35 
from human prostate cancer with rat retrotransposon RatLlRnB6 ; 
and 

Fig. 4 is a gel electrophoresis analysis of difference 
2 0 sequences between two cDNA populations. 



DESCRIPTION OF THE SPE CIFIC EMRQn^ FMTg 
Methods are provided for representational difference analysis 
("RDA") between two sources of DNA. The method permits the 
detection of sequences which differ between the tw- sources, 

25 where under selective conditions of hybridization, DNA from one 
of the two sources is not significantly hybridized to DNA from 
the other source. Sources include genomes, sets of DNA 
fragments, usually > 0.2 kbp, collections of restriction 
endonuclease-cleaved fragments, cDNA or cDNA libraries, etc. 

3 0 The method involves a first step, referred to as 
representation, and then two or more further steps referred to 
as subtractive and kinetic enrichment, which may be repeated 
in order to provide for substantial enrichment of the sequences 
of interest. 

3 5 For the purpose of this invention, a number of coined terms 
will be used. "Driver" DNA is DNA from a source which will be 
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used to determine the presence of DMA in a second source, the 
"tester" source. Those fragments that are unique or in higher 
concentration to the tester DNA, as compared to the driver DNA, 
will be referred to as "target 19 DNA. The DNA sequences are 
5 obtained in a first stage resulting from restriction 
endonuclease digestion, followed by linkage of adaptors and 
then amplification with primers complementary to the adaptors. 
The resulting DNAs are referred to as " ampl icons. " The 
amplicons will be characterized by being under about 2 kb and 

10 usually at least about 0.5 kb, where the teraini will normally 
have the same restriction endonuclease recognition sequence 
prior to linkage to the adaptors. 

The subject application may find use in a wide variety of 
situations. In determining the presence or absence of 

15 particular DNA sequences , particularly associated with 
recessive or dominant traits, one can compare two related 
sources of DNA to determine whether they share the particular 
sequence, where the sequence may be a coding or non-coding 
sequence, but will be inherited in association with the DNA 

20 sequence (s) associated with the trait. One can use the subject 
method in forensic medicine, to establish similarities between 
the DNA from two sources, where one is interested in the degree 
of relationship between the two sources. The subject method 
can also be applied in the stuuy of diseases, where one can 

2 5 investigate the nT-esence of a sequence associated with 

infection, such as a viral sequence which may or may not be 
integrated into the genome. One may also use the subject 
methodology in studying changes in the genome as a result of 
cancer, whsre cancerous cells may be compared to normal wild- 
30 type cells. Thus, the subject methodology has application for 
detecting genetic rearrangements, genetic loss, gene or other 
DNA amplification, for identification of DNA from pathogenic 
organisms integrated into the genome or present in the cellular 
host, for identification of polymorphisms located at or near 

3 5 genes associated with inherited disorders, for identification 

of genes which are expressed in a particular cellular host, 
identification of lesions in neoplastic cells, and the like. 
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In carrying out the subject method, there are concerns which 
should be considered when applying the subject method. The PCR 
may be a source of artefact, due to the stochastic nature of 
the process. Therefore, each candidate difference product 
5 should be tested for its presence or absence in tester and 
driver amplicons. Another source of artefact may occur during 
tissue sampling. Normal flora contaminating a specimen of 
tester will be readily enriched during difference analysis if 
that flora is not also present in driver. Genetic mosaicism 
10 may be encountered. In situations where one is dealing with 
polyclonal tissue, such as in a cancer biopsy, there must be 
a minimum proportion of cells which has the particular mutation 
in order to be able to detect the presence of the mutation. 
Therefore, it would be desirable to use cultures of cancer 
15 cells or highly purified cancer cells obtained by physical 
separation as the source for the tester DNA. In the case of 
discovery of pathogens, there should be a careful matching of 
the polymorphisms from the infected and uninfected DNA source. 
In the latter case, tester and/or driver DNA may derive from 
20 the same individual, come from an identical twin, come from 
separate but related individuals, be the pooled DNA from the 
parents of the tested individual, be pooled DNA from related 
sources, e.g. cell strains, common genetic dysfunction, or 
comropn trait, or the like. 
25 Finally, not all restriction endonucleases will be equivalent 
in the ease with which target DNA may be identified. 
Therefore, in each case it will be desirable to use a plurality 
of restriction endonucleases in separate determinatiwns, not 
only to ensure that one obtains target DNA within a reasonable 
30 number of cycles, but also to increase the number of target DNA 
sequences that may be obtained. 

Turning now to the specific process, the first stage is the 
isolation of DNA. As already indicated, the DNA may be from 
any source, eukaryotic or prokaryotic, invertebrate or 
35 vertebrate, mammalian or non-mammalian, plant or other higher 
eukaryotic source. ***While, from the standpoint of direct 
application to human interests, the sources will be human DNA, 
the subject methodology is applicable to any complex genome, 
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where one is interested in identifying the presence or absence 
of related DNA, such as laboratory animals, plants, domestic 
animals, or in any other situation where an inbred or outbred 
population is of interest. Normally, the DNAs will be from 
5 closely-related sources, so that the number of target DNA 
seguences which are obtained will be relatively restricted in 
number, frequently being fewer than about 10 4 , usually fewer 
than about 10 3 , different sequences. While genomic DNA will 
usually be the source of driver and tester DNA, cDNA may also 
10 be used, where one is interested in the differences between two 
cDNA populations from two different mJRNA sources.*** 

In the first stage, the DNA is isolated, freed of protein, 
and then substantially completely digested with a restriction 
endonuclease which provides for relatively infrequent cutting. 
15 Usually, the restriction endonuclease will have a consensus 
sequence of at least six nucleotides and may provide for. blunt 
ends or staggered ends, usually staggered ends. Various 
restriction endonucleases may be employed, such as BamHI, 
Saill, Hindlll, etc. After digestion of the DNA, double- 
20 stranded oligonucleotide adaptors are ligated to the ends of 
each of the strands of the DNA from the driver and the DNA from 
the tester. The adaptor will usually be staggered at both 
ends, with one strand being longer and serving as the sequence 
complementary to the primer. The adaptor will be double* 
25 stranded and have one end complementary to the ends of the 
dsDNA from the digestion. The DNA from the two sources is then 
separately amplified, by adding primer and using the polymerase 
chain reaction with extension for the last round, usually 
employing at least 10 cycles, more usually at least 15 cycles 
30 and generally not more than about 30 cycles, more usually not 
more than about 25 cycles and preferably about 20 cycles. 
After this number of cycles, for the most part, the fragments 
will be mainly less than about 2 kb, usually below about 
1.0 kb. The adaptors are then removed by restriction 
3 5 endonuclease digestion and physical separation, using any 
convenient means. 

As distinct from a physical fractionation, the amount of 
starting material is not limiting when using representation. 
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When employing amplicons of mammalian DNA after cleavage with 
BamH I. BqI II and Hin dlll, the estimated complexity of the 
resulting amplicons are 55-fold r 13-fold and 8-fold less than 
the complexity at the starting genomic DNA, respectively 
5 (Bishop et al. , Am. J. Hum. Genet. 35, 795 [1983]). 

Other methods of representing the genome to reduce its 
complexity may be employed. For example, cleavage with a more 
frequently cutting enzyme , e . g . a 4 nt consensus sequence 
restriction enzyme, followed by addition of adaptors, PCR 
10 amplification and size fractionation, will achieve this end. 
Another method might use oligonucleotides as primers to 
repetitive DNA in the genome to amplify a representational 
portion of the genome, flanking repetitive sequences. 

In the next phase, subtract ive and kinetic steps are employed 
15 in a single operation of hybridization and amplification. If 
desired, the steps may be separated, but will preferably be 
done contemporaneously. The first aspect of this stage is the 
ligation of PCR adaptors to the 5' ends of tester amplicon 
fragments or the products of previous rounds of enrichment, 
20 when the procedure is reiterated. Ligation to the 3' ends of 
tester amplicon is to be avoided, which can be achieved, for 
example, by using adaptors that are not phosphorylated at their 
5' ends. Usually, the adaptor chain complementary to tne primer 
will be at least about 12 nt, more usually at least 17 nt, and 
25 generally fewer than about 200 nt, more usually fewer than 
about 100 nt. Any convenient method for ligation of the 
adaptors to the 5' ends may be employed, as appropriate. 

The tester amplicon fragments joined to the adaptors are then 
combined with the driver amplicon fragments and melted and 
30 allowed to reanneal. The driver amplicon fragments will be 
present in substantial excess, usually at least 5-fold excess, 
and the excess may exceed 50 or more, usually not exceeding 
about 10 8 -fold excess, more usually not exceeding 500-fold 
excess. The ratio of driver DNA to tester DNA need not be 
35 constant for the different rounds. Usually, the ratio will 
increase with successive rounds where the increase may vary 
from about 1:1 to 10 3 . The initial ratio will generally be in 
the range of about 10 to 1000- fold excess. Conveniently, 
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melting will be achieved by heating at an elevated temperature, 
generally > 95°C and hybridization proceeding at about 60°C, 
where various buffers may be employed, as well as salt 
concentrations, to provide the necessary stringency. Usually, 
5 fairly high stringencies will be employed, generally at least 
about eguivalent to or greater than about 0.1 M NaCl, usually 
about 1 M NaCl. 

After melting and reannealing, there will be a substantial 
enrichment of target OKA in the total double-stranded DNA, 

10 since the target DNA will not be inhibited from self-annealing 
due to the lack or relative deficiency of complementary 
sequences present in the driver DNA. 

Overhangs are then filled in by employing any convenient DNA 
polymerase, e.g., Taq DNA polymerase, in the presence of the 

15 four nucleotides, whereby only double-stranded, self-reannealed 
tester DNA will have filled- in adaptors at each end of the 
amp 1 icon. Since the driver DNA does not inhibit target DNA 
from self-annealing, while the driver DNA inhibits non-target 
tester DNA from self -annealing, there is a substantial 

20 enrichment in the target DNA as compared to the total tester 
DNA. 

The double-stranded self-reannealed tester amplicon will then 
be amplified under conventional polymerase chain reaction 
conditions, usually involving at least about 5 cycles, 

25 frequently as many as 10 cycles and usually not more than about 
4 0 cycles, preferably not more than about 30 cycles. The 
amplification may be interrupted about midway and single- 
stranded DNA degraded using an appropriate nuclease. Various 
nucleases may be employed, particularly mung bean nuclease. 

3 0 The resulting double-stranded DNA mixture may then be 
digested with a restriction endonuclease which removes the 
adaptors from the tester DNA. The tester DNA may be separated 
from the adaptor sequence, using any convenient means which 
permits separation by size. Gel filtration or gel 

3 5 electrophoresis may be conveniently employed. The amplicons 
may then be ligated to a second set of adaptors, usually 
different from the first or previous set and the cycle of 
melting in the presence of excess driver amplicon, annealing, 
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filling in overhangs, and PCR amplification repeated. Later 
cycles may rely on the previous adaptors. In the subject 
process, this cycle may be repeated one or more times, there 
usually being at least 2 rounds or repetitions and not more 
5 than about 6 rounds, usually 2 to 4 rounds being sufficient. 
It will frequently be of interest to carry out the process 
more than once, where different restriction endonucleases are 
employed for each study. In this way, different ainplicons will 
be obtained and one may obtain different information. 
10 Depending upon the purpose for the process, two or more 
restriction endonucleases may be utilized in separate 
preparations of the ainplicons. One may also compare the probes 
obtained with different restriction endonucleases to determine 
if they overlap, bind to genomic DNA sequences which are 
15 proximal, are part of the same gene or polymorphic region, and 
the like. 

In carrying out the process, the first round is mainly 
subtract ive . Subsequent rounds have a greatly- increased 
component of kinetic enrichment. For example, if target DNA 
20 is equimolar with respect to tester DNA (i.e. a single copy), 
and if driver amplicon is taken in N-fold excess to tester 
amplicon, assuming virtually complete reannealing of driver 
amplicon, target will be enriched N times after the first 
round. After the second round, target will be enriched N* 

2 5 multiplied by a factor due to the subtractive component, and 

after the third time, at least the square of that. If N is 50, 
at the end of the second round, target will be enriched by 
about 10 4 , and at the end ox the third round, on the order of 
10 8 . In general a single cycle of subtraction can be expected 
30 to yield enrichments of target in the order of fN, where N is 
the molar excess of driver amplicon to tester amplicon and f 
is the fraction of driver amplicon that reanneals. 

The resulting target DNA or difference product may be further 
enriched for probes defining differences between the DNA 

3 5 sources. Conveniently, the sequences may be cloned and then 

screened using Southern blots or other technique for 
determining complementation against tester and driver 
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amplicons. Those clones which hybridize to tester amplicons 
and not driver amplicons may then be used further. 

The resulting target DNA may be used as probes to identify 
sites on the tester DNA genome which differ from the driver 
5 DNA. For this purpose, they may be labeled in a variety of 
ways, -such as with radioactive labels, biotin, fluorescers, 
etc* Desirably , in order to obtain substantially homogeneous 
compositions of each of the target amplicons , the target 
amplicons may be cloned by inserting into an appropriate 

10 cloning vector for cloning in a prokaryotic host. If desired, 
the cloned DNA may be sequenced to determine the nature of the 
target DNA. Alternatively, the cloned DNA may be labeled as 
described above, and used as probes to identify fragments in 
libraries carrying the target DNA. The target DNA may be used 

15 to identify the differences which may be present between the. 
two sources of DNA. 

Where a plurality of probes for target DNA are obtained, they 
may be referred to as putative probes until established as true 
probes. Conveniently, the sequences may be cloned and then 

20 screened using Southern blots or other technique for 
determining complementation against tester and driver 
amplicons. Thus, the group of probes may include hybridizing 
sequences which hybridize to both driver and tester DNA. One 
can quickly determine those putative probes which do not 

25 distinguish between driver and tester DNA by hybridizing, e.g. 
Southern hybridiziang, the probe to driver and tester 
amplicons. Where the putative probe binds to both driver and 
tester amplicons, the probe may be discarded. Those clones 
which hybridize to tester amplicons and not driver amplicons 

3 0 may then be used further. This screen is particularly useful 
where at least 5, more usually at least 10 putative probes are 
obtained. 

In pedigree analysis, the subject process may be used 
to define sequences which are present in one member of a family 
3 5 and not present in another. In this way, one may then compare 
other members of the family as to whether they carry the same 
DNA or it is absent. This may find use in forensic medicine, 
where there may be an interest in the relationship between two 



WO 94/11383 



PCT/US93/1072 



- 11 - 

individuals, a sample obtained from a source and an individual, 
or the like. 

The subject method can also be used to construct libraries 
of probes for genetic polymorphisms, which may be referred to 
5 as PARFs, which is operationally defined as a polymorphic 
restriction endonuclease fragment, present in the amplified DNA 
from one genome and not present in the amplified DNA from a 
different genome from a like organism. For example, if one of 
two BaffiHI sites flanking a shc~t BamHI fragment in tester DNA 
10 is absent in both alleles from driver DNA, leading to only 
large lamHl fragments in driver, the short BamHI fragment of 
tester will be present in its BamHI amplicon, but absent in the 
BaffiHI amplicon of the driver. Thus, the restriction fragment 
would directly lead to a probe which will distinguish between 
15 the two genomes. 

It should be appreciated, that where the amplicons are 
cloned, there may be substantial redundancy in individually- 
picked clones. Therefore, the efficiency of selecting 
different probes will vary substantially depending upon the 
20 frequency in which the amplicon was present in the mixture 
prior to cloning, which may be as a result of the varied 
efficiency of amplification, or other artefacts which are built 
into the methodology. 

The subject method can be used to isolate probes for 
25 pathogens, where DNA Which is suspected of being infected may 
be compared to DNA which is believed to be uninfected. For 
example, if one were interested in a virus which is tropic for 
a particul— cell type or tissue, e.g., HIV for T-cells and 
macrophages or hepatitis B virus for liver, one could take 
30 tissue from the source suspected of infection for which the 
virus is tropic and tissue from another site in the same 
individual, where such virus should not be present. By 
carrying out the process, one should obtain probes which would 
be specific for the virus, since by appropriate selection of 
3 5 the sources of the cells, one would not anticipate any other 
differences. 

A limitation of the subject process, which will be applicable 
to viruses, as well as other situations, is that the population 
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carrying the target DNA should be a reasonable proportion of 
the total number of cells from which the tester DNA is derived. 
As indicated above, where one is interested in the presence of 
integrated pathogenic DNA, it may be that only a small 
5 proportion of these cells in the tissue are infected. It may, 
therefore, be desirable to normalize the tester sequences, in 
order to equalize the concentrations of all tester sequences, 
prior to the subtractive and kinetic enrichment (Patanjali 
et al., Proc. Natl. Acad. Sci. USA 88, 1943 [1991]). 

10 Application of RDA to the discovery of pathogens desirably 
requires a careful matching of the polymorphisms from the 
infected and uninfected DNA sources. Tester and driver DNA can 
derive from the same individual, if the individual is not a 
genetic mosaic. These DNAs should not derive from unrelated 

15 individuals, as the abundant polymorphic differences in their 
DNAs would obscure the detection of the pathogen. However, the 
uninfected DNA source (driver) could, in principle, come from 
an identical twin, or be the pooled DNA from the parents of the 
infected individual, because virtually all of the DNA 

20 restriction fragments found in the genomic DNA of the infected 
individual can be expected to be present in at least one parent 
DNA. 

The subject methodology may also be applied to detecting 
genomic alterations occurring in cancer cells* These could be 

25 of three distinct types: those that result in loss of 
restriction endonuclease fragments, such as might occur from 
deletions or gene conversions extending over heterozygous 
polymorphisms; those that produce new restriction endonuclease 
fragments, such as might result from point mutations or genomic 

30 rearrangements; and those that result in the amplification of 
DNA, usually iricorportating a gene. In the second and third 
cases, RDA could be applied without modifications using DNA 
from cancer cells as tester and normal DNA as driver- However, 
the presence of normal stroma in a cancer biopsy could 

3 5 interfere with the detection of loss of genetic information in 
the cancer cell. Hence, either cultures of cancer cells or 
highly-purified cancer cells obtained by physical separation 
would be needed as the source for tester in the first case. 
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These restraints do not apply to the detection of genomic 
rearrangements. Genomic rearrangements, including 

translocations, insertions, inversions and deletions, will 
result in the creation of new restriction endonuclease 
5 fragments bridging the site of the rearrangement. Some of 
these bridging fragments may be amplifiable, while at least one 
of the fragments from which they derive in normal DNA is not. 
Such bridging fragments would be discoverable by RDA, when DNA 
from the tumor is used for preparation of tester amplicons and 
10 DNA from normal tissue of the same individual is used for 
preparation of driver amplicons. 

The different-sized restriction endonuclease fragments 
created by genomic rearrangements may be exploited another way. 
Fractionated size classes from tumor DNA digests will sometimes 
15 contain sequences that are not present in comparable-size 
classes from normal DNA. Using" the former as tester and the 
latter as driver, one can prepare amplicons after cleavage with 
a second restriction endonuclease and compare these by RDA in 
order to clone amplifiable restriction endonuclease fragments 
20 in proximity to the point of genetic rearrangement. With 
either of the above-indicated methods, the presence of normal 
cells among the tumor cells will not obscure the detection of 
probes for the rearrangement. 

In the final situation, DNA amplification, it appears that 

2 5 the detection of amplification is a result of kinetic 

enrichment during RDA. Being able to detect amplified 
sequences can find application in cancer prognosis, since it 
has been found that amplif iwation of oncogenes indicates a poor 
prognosis. 

30 When RDA is applied to different individuals, it will yield 
a collection of polymorphisms of a type, which has been 
previously referred to as PARFs. Thus, RDA can be used for 
generating new sets of polymorphisms, not only for species that 
have not previously undergone extensive molecular genetic 

3 5 characterization, but also for well-studied species as humans 

and mice. Since PARFs most often detect binary polymorphisms, 
they can serve as a panel of probes that can be used with a 
standardized format for genetic typing. 
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In yet another application, RDA can yield probes for PARFs 
present in the DNA of an individual from a founder group 
affected by some autosomal dominant inherited disorder (the 
tester) , but absent in the DNA of an individual from a normal 
5 group (the driver). Conversely, RDA can yield probes for PARFs 
present in the DNA of a normal individual (the tester), but 
absent in the DNA of an individual from the founder group 
affected by a recessive inherited disorder (the driver). 
Combined with methodologies for coincidence cloning (Brooks and 
10 Porteous, Nuc. Acid Res. 19, 2609 [1991]), such applications 
can accelerate the discovery of probes for rare PARFs in 
linkage disequilibrium with the dominant locus, or the absence 
of common PARFs in linxage disequilibrium with the recessive 
locus. 

15 In many laboratory animals and plants there are cqngenic 
strains, where a particular gene has been transferred from one 
genetic background onto another by successive generations of 
backcrossing. Such strains will be genetically identical 
except in a relatively small region surrounding the gene of 

20 interest. The region will be typically small enough to permit 
chromosomal walking to the target gene, but large enough for 
the needs of the subject methodology. 

The subject methodology may be applied to the discovery of 
polymorphisms that are genetically linked to an inherited trait 

2 5 such as a disease susceptibility or a behavorial abnormality. 

To utilize the subject methodology for this purpose, it is 
desirable to use pools of DNAs from a group of individual for 
use as either tester, driver or both. When used this way, the 
method may yield probes that detect polymorphic alleles that 

3 0 are present in one group and not in another. In particular, 

when such pools are used as driver, the probes obtained for 
restriction endonuclease polymorphisms ( M PARFs M ) that 
distinguish tester from all individuals in the driver pool. 
When pools are used as tester, the method yields PARFs that 
3 5 distinguish at least one member of the tester pool from the 
driver individual. In the most challenging example, when both 
tester and driver are pooled DNAs from groups of individuals, 
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the method yields PARFs that distinguish at least one member 
of the tester group from all members of the driver group. 

Pooling may be demonstrated in a variety of situations. One 
application uses transmission genetics to produce a collection 
of siblings with the property that their pooled DNA is 
homozygous in the region of a target gene but heterozygous 
elsewhere in the genome. As an illustration, if two inbred 
strains differ at a target locus L of interest, one strain A 
carries a recessive allele (a) and the other strain B carries 
a dominant allele (a+) , for tester one can use strain B, while 
for Driver, one performs an F2 intercross between the strains, 
selects k progeny showing the recessive phenotype, and mixes 
their DNA together. When employing the subject method, B 
alleles should be subtracted everywhere in the genome except 
15 in a region around L. 

The targetting of the method can be further improved where 
the locus L has been genetically mapped between two flanking 
genetic markers, X and Y. For the driver, one can select 1/2 
k progeny in which a crossover had occurred between X and L and 
1/2 k progeny in which a crossover had occurred between L and 
Y. this would guarantee that the proportion of B alleles is 
25 % at X and Y. This ensures that the region over which the 
proportion of B alleles is very low is restricted to the 
interval X -Y. 

The pools may be of various sizes depending on the source of 
DNA. From large genomes, such as mammalian and plant genomes, 
generally a pool as small as 8 different sources may be 
employed, usually io, and generally not more than SO, usually 
not more than about 20. 

Other applications may involve spontaneous germ line genomic 
rearrangements. The genome of such an infected individual will 
include restriction endonuclease fragments that are present in 
neither parent. This situation is analogous to genetic 
rearrangements occurring in cancer cells, which has been 
35 previously discussed. 

To ensure that the subject process has operated properly, it 
will normally be desirable to test candidate difference 
products (target DNA) for its presence or absence in tester and 
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driver amplicons. Also of concern will be the presence of 
flora, which may contaminate tester, but is not present in 
driver; Genetic mosaicism ™\ 11 also interfere with the subject 
methodology. However, in a wide variety of contexts, the 
5 subject method will efficiently provide sequences which can be 
used for analyzing differences between two genomes as a result 
of a wide variety of events. 

The following examples are offered by way of illustration and 
not by way of limitation. 

10 EXPERIMENTAL 

Preparation of Amplicons. 10 /ig of high molecular weight DNA 
purified from the lymphoid cell line DRL 484 (a gift of 
T. Caskey, Baylor College) was used for preparation of driver 
amplicons and 10 ug of the same DNA, containing eguimolar 

15 amounts of target (120 pg of adenovirus-2 DNA and/or 160 pg of 
X phage DNA, both from New England Biolabs) was taken for 
preparation of tester amplicons. Both tester and driver DNA 
samples were digested with restriction endonuclease (New 
England Biolabs) and 1 ag of each DNA digest was mixed with 
20 0.5 nmoles of 24-mer and of 12-mer unphosphorylated 
oligonucleotides (set 1, see Table 1) in 30 mL of T4 DNA ligase 
buffer (New England Biolabs) . 



Table l. Sequences of Primers Used for Representational 
25 Difference Analysis. 



30 



Primer 
Set 


Name 


Sequence 


1 


R Bgl 24 


5 ' -AGCACTCTCCAGCCTCTCACCGCA-3 ' 




R Bgll2 


5 ' -GATCTGCGGTGA-3 ' 


2 


J Bgl24 


5 ' -ACCGACGTCGACTATCCATGAACA- 3 ' 




J Bgll2 


5 ' -G ATCTGTTC ATG - 3 ' 


3 


N Bgl24 


5 ' - AGGCAACTGTGCTATCCG AGGG AA- 3 ' 




N Bgll2 


5 ' -GATCTTCCCTCG-3 ' 


1 


R Bam24 


5 ' -AGCACTCTCCAGCCTCTCACCGAG-3 ' 




R Baml2 


5 ' -GATCCTCGGTGA-3 ' 
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Primer 
Set 


Name 


1 

Sequence 


2 


J Bam24 


5 ' -ACCGACGTCGACTATCCATGAACG-3 ' 




J Baml2 


5 ' -GATCCGTTCATG-3 ' 


3 


N Bam24 


5 ' - AGGCAACTGTG CTATCCG AGGG AG - 3 ' 




N Baml2 


5 ' -GATCCTCCCTCG-3 ' 


1 


R Hind24 


Same as R Bgl24 (see above) 




R Hindl2 


5 ' -AGCTTGCGGTGA-3 ' 


2 


J Hind24 


Same as J Bgl2 4 (see above) 




J Hindl2 


5 ' -AGCTTGTTCATG-3 ' 


3 


N Hind24 


5 ' -AGGCAGCTGTGGTATCGAGGGAGA-3 ' 




N Hindl2 


5 ' -AGCTTCTCCCTC-3 ' 


1 


Seg24 


5 ' -CGACGTTGTAAAACGACGGCCAGT-3 




Rev25 


5 ' - C ACACAGG AAACAGCTATG ACC ATG - 3 ' | 



Primer set 1 (R series) is used for representations, and sets 
2 (J series) and 3 (N series) are used for odd and even 
hybridization/amplifications, respectively. Oligonucleotide 
10 design was checked for the absence of strong secondary 
structure using the OLIGO computer program (National 
Biosciences) • 

Oligonucleotides were annealed by cooling the mixture 

15 gradually froir 50°C to 10°C for one hour and then ligated to 
human DNA fragments by overnight incubation with 400 U of T4 
DNA ligase at 16 °C. Following ligation, both tester and driver 
DNA samples were amplified. Each of 10 tubes taken for 
preparation of driver amplicons and 2 tubes used for 

20 preparation of tester amplicons contained in a volume of 
400 Ml: 67 mM Tris-HCl, pH 8.8 at 25°C / 4 mM MgCl 2 , 16 mM 
(NH 4 ) 2 S0 4 , 10 mM 0-mercaptoethanol, 100 Mg/ml bovine serum 
albumin, 200 uM (each) dATP, dGTP, dCTP. and dTTP, 1 mM 24-mer 
primer and 80 ng of DNA with ligated adaptors. The tubes were 

25 incubated for 3 min. at 72°C in a thermal cycler (Perkin Elmer 
Cetus) , 15 U of Taq polymerase (AmpliTaq, Perkin Elmer Cetus) 
was added, the reactions were overlaid with mineral oil , 
incubated for 5 min. to fill in 5' protruding ends of ligated 
adaptors, and amplified for 20 cycles (each cycle including 

30 1 min. incubation af 95°C and 3 min. at 72°C, with the last 
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cycle followed by an extension at 12°C for 10 rain.)- After 
amplification both driver and tester aroplicons were digested 
with the sane restriction endonuclease (10 U/jig) to cleave away 
adaptors. io ug of tester amplicon ONA digest was 
5 electrophoresed through 2% NuSieve agarose (low melting point, 
FMC Bio Products), and DNA fragments (150-1500 bp) were 
recovered after melting of the agarose slice and Qiagen-tip20 
chromatography (Quiagen Inc.) to remove adaptors. These 
fragments were ligated to a new set of adaptors (primer set 2, 
10 see Table l) in preparation for the first round of 
hybridization and amplification. 

PNA Hybridization and Amplifica t ion st«p. 0 .5 Hq of the tester 
amplicon ligated to adaptors and 40 nq of driver amplicon DNA 
were mixed, ethanol precipitated, dissolved, in 4^1 of 3xEE 
15 buffer (Straus and Ausbel, Proc. Natl. Acad. Sci. USA 87, 1889 
[1990]) and overlaid with 30 M l of mineral oil (Perkin Elmer 
Cetus) . Following heat denaturation 1 Ml of 5 M NaCl solution 
was added and DNA was hybridized for 20 h at 67 »C. At the end 
of hybridization, l/ioth part of the resulting DNA was 
incubated with 15 U of Tag polymerase (5 min. , 72»C) in 400 M l 
of PCR mixture without primer to fill in ends of reannealed 
tester, and then amplified for 10 cycles (l min. at 95«c, 
3 min. at 70*0, followed by io min. extension for the last 
round) after addition of the same 24-mer oligonucleotide to 
which tester was ligated. single stranded DNA molecules 
present after amplification were degraded by 30 min. incubation 
with 20 u of mung bean nuclease (New England Biolabs) in a 
volume of 40 ^1 as recommended by the supplier followed by 
5-fold dilution of the sample in 50 mM Tris-HCl, pH 8.9 and 
heat inactivatibn of enzyme (95'C, 5 min.). 40 M l of the 
solution was amplified for 15-20 cycles under the same 
conditions as before the mung bean nuclease treatment. 
Amplified DNA (3-5 M g) was digested with the original 
restriction endonuclease and 200 ng of the digest was ligated 
to the third adaptor set (see Table 1) . 50-100 ng of this DNA 
was mixed with 40 ng of driver amplicon and the hybridization 
and amplification procedures were repeated as in the first 



20 



25 



WO 94/1 1383 



PCT/US93/10722 



- 19 - 

cycle. 200 ng of the digest obtained after the second 
hybridization/amplification step was then ligated to the second 
set of adaptors and 100-400 pg of this material together with 
4 0 Mg of driver amplicon was taken for the third round of 
5 hybridization, with the final amplification after mung bean 
nuclease digestion for 20-25 cycles. A fourth hybridization/ 
amplification step was performed after taking 5 pg of material 
from the third round ligated to adaptors of the third set and 
mixing it with 40 nq of driver amplicon. 

10 Exercple i. Representational Difference Analysis with Viral 
DNAs Add*'* as Targets. 
Single-copy levels of adenovirus and/or bacteriophage X OKA 
was added to human DNA to create a model tester, and used with 
the same human DNA without viral DNA as driver. Bal ll 

15 amplicons from human DNA with adenovirus and X DNAs as targets 
or gindlll amplicons with X DNA as target were prepared. With 
Bglll amplicons, small X and adenovirus fragments were the 
major difference products, even after two rounds, as evidenced 
by agarose gel electrophoresis. This represented an enrichment 

2 0 of > 5 x 10 6 -fold from the starting material and a probable 
enrichment of about 4 x io 5 -fold from amplicons. 

The enrichment from Kindlll amplicons was not as effective. 
The X Hindlll fragment was greatly enriched after the third 
round as evidenced by blot hybridization, but still not to 

2 5 homogeneity. After the fourth round the expected target 

fragment was purified to near homogeneity. The difference 
between the experience with the Hindlll restriction 
endonuclease and the fialll restriction endonuclease may be 
related to the greater sequence complexity of the Hindlll 

3 0 amplicons. When the complexity of the driver is too high, 

subtractive and kinetic enrichments are diminished and 
competing processes may dominate. The competing processes may 
involve the emergence of efficiently-amplified repetitive 
sequences in tester. 

3 5 Example 2. Representational Difference Analysis of DNAs from 
Two Individuals. 
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Driver and tester amplicons were prepared from human 
lymphoblastoid cell cultures GM05901 and GM05987, respectively 
(Amish Pedigree 884, Human Genetic Mutant Cell Repository, 
Camden, NJ) . Amplicons were prepared after cleavage with 
5 BamHI, fialll or flindlll. Difference products between amplicons 
were obtained as described above and size fractionated by gel 
electrophoresis. A discrete but complex pattern of bands was 
observed in each case. After three 

hybridizations/amplifications, difference products were cloned 
10 into plasmids. For each difference product, three prcbes were 
picked for blot hybridization analysis. It was found that all 
of them were polymorphic within the Amish family data. BamHI 
difference products were analyzed in greatest detail. 
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BamHI amplicons were prepared from DNA from seven Amish 
pedigree lymphoblastoid cell cultures, GM05901 (driver) , 
GM05987 (tester), GM05918, GM05961, GM05963, GM05993, GM05995 
(columns A-G) , five different placentas (columns H-L) , three 
5 lymphoblastoid cell lines established from the biopsies of 
leukemic patients (columns M, N, O) and two fibroblast cell 
cultures, DRL 484, and DRL 569 (a gift of T. Caskey, Baylor 
College) established from the biopsies of DMD patients 
(columns P, Q) , transferred to GeneScreen membrane, and 

10 hybridized to the indicated probes. indicates the percent 

of clones in a Bam HI PARF collection of difference products 
cloned after three hybridization-amplification steps that 
hybridized to the indicated clone . M +" means that the small 
BamHI PARF allele was present in the sample (i.e. the probe 

15 hybridized to a band of the correct size in the amplicon) ; 

means that the small allele was not detected. See Fig. 3C for 
a sample of the actual data. The lengths of the alleles 
hybridizing to PARFs are indicated, where known. M ND M means 
not determined. 

20 ta) Two different small alleles were found in the human 
population. 

(b) Two different large alleles were found in the human 
population. 



25 Of 20 randomly-picked clones, 12 unique clones remained after 
removing redundancies, and the inserts from 9 of these were 
used as probes in Southern blots of tester, driver and 5 other 
members of the family (GM05918, GM05987 [tester], GM05901 
[driver], GM05961, GM05963, GM05993, and GM05995 from Amish 

3 0 pedigree 884). All probes detected small BamH I fragments in 
the tester (Table 2, col. B) and only large BamHI fragments in 
the driver (Table 2, col. A). The blot hybridization pattern 
for each probe was completely consistent with a Mendelian 
pattern of inheritance. The results demonstrate that 

3 5 collections of probes for restriction endonuclease fragment 
polymorphisms may be obtained between two related individuals. 
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Each of the BamHI probes derived from the above experiment 
was also used in blot hybridizations to amplicons from the 
family and 10 other unrelated human DNAs extracted from cell 
lines or placentas (Table 2). Complete concordance between 
5 this method and Southern blotting of total genomic DNA was 
found. These results support the conclusion that the probes 
which detect polymorphisms within the Amish family will also 
detect polymorphisms in the human population at large. As 
indicated previously, these polymorphisms are referred to as 
10 PARFs (polymorphic amplifiable restriction endonuclease 
fragments) . 

The probes for PARFs are not equally abundant in the 
difference product. To obtain a measure of this unevenness, 
each cloned BarsHl PARF was hybridized to a grid of 90 
15 individually randomly-picked clones from the difference product 
of the two siblings, and its frequency in the collection, was 
determined (see percent value in Table 2) . From a total of 90 
randomly-picked elements, only 20 distinct polymorphic probes 
were present. 

20 it should be noted that the protocol was designed for the 
detection of a small number of differences between two nearly- 
identical genomes. where probes for polymorphic loci are 
deliberately sought, more representative difference products 
can be generated by diminishing the number of rounds of 

25 hybridization/amplification, increasing the complexity of the 
representation and/or decreasing the total number of PCR 
cycles. 

*** The following is an exemplary protocol used in the 
following examples, except where otherwise indicated. 



30 



DIFFEREN CE ANALYSTS PROTOCOL 



I. Preparation of amplicons 

1. Restriction of DNA. 

a. Digest io nq of Driver and Tester DNA with a 
restriction enzyme chosen for representation, taking- 10 U/yg 
3 5 of high molecular weight DNA. 
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b. Extract with equal volumes of phenol and 
phenol /chloroform. 

*c. Add NaOAc to final concentration 0.3 M, EtOH ppt. , 
wash with 70% EtOH, dry in vacuo and resuspend at 0.1 mg/ml. 
5 2. Purification of oligonucleotides 

a. Attach Sep-Paq cartridge (Waters, Millipore) to 
5 ml syringe and wash it with 10 ml of acetonitrile and 10 ml 
of water. 

b. Load 20 OD 260 of the oligonucleotide in 2 ml of 
10 water, wash with 10 ml of water and elute with 60% MeOH, 

collecting 7 fractions in Eppendorf tubes (3 drops per each 
tube) . 

c. Measure DNA concentration of 200 fold dilutions at 
X=260 nm, combine DNA containing fractions (approx. 500 Ml) and 

15 concentrate by liophylization up to 200-300 

d. EtOH ppt. (use 4 vol. of EtOH) after addition of 
1/10 vol. 3 M NaOAc , wash with 100% EtOH, dry, resuspend at 
62 pmol/Ml (12 OD 2fi0 /ml for 24-mers and 6 OD 260 /ml for 12-mers) . 

3. Ligation of adaptors 

20 a. Mix: 20 Ml (2 Mg) of Driver or Tester DNA digest, 

15 m1 of each 12-mer and 24-mer (primer set 

1) , 

4 Ml of ddH 2 0, 

6 m1 of 10 x Ligase buffer. 
25 *>. To anneal the oligonucleotides, place the tubes in 

a heating block (Termoline DriBath, holes filled with glycerol) 
at 50-55 °C and then place the block in a cold room for approx. 
1 h, until the temperature will decrease to 10-15°C. 

c. Place the tubes on ice for 3 min. , add 2 Ml 
30 (400 U/m1) of T4 DNA ligase, and incubate overnight at 12-16°C. 

4 . PGR 

a. Add 94 0 m1 of TE (10 mM Tris-HCl, pH 8.0/1 mM EDTA) 
plus tRNA (20 Mg/ml) buffer to each ligate to make a dilution. 

b. Makes 2 tubes of PCR mix for preparation of Tester 
35 amplicon and 10 tubes for preparation of Driver amplicon, each 

containing: 

80 Ml of 5 x PCR buffer (335 mM Tris-HCl, pH 8 . 8 
at 25°C, 20 mM MgCl,, 
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80 mM (NH 4 ),S0 4/ 50 mM /3-mercaptoethanol, 0.5 mg/ml 
of bovine serum albumin) 

32 yl of chase solution (4 mM of each dATP, dGTP, 

dCTP, dTTP) 

5 8 /il of 24-mer oligonucleotide (primer set l) 

240 Ml of ddH ; 0. 
c. Add 4 0 Ml of DNA ligate dilution (80 ng) in each 
tube and place the tubes in a Thermocycler (PerJcin Elmer Cetus) 
at 72«C. 

10 d - To fill-in 5 ' -protruding ends of the ligated 

adaptors, add 3 m1 (15 U) of AmpliTaq DNA polymerase in each 
tube (use Aerosol Barrier Pipet Tips) , mix, overlay with no m1 
of mineral oil and incubate for 5 min. 

e. Amplify for 20 cycles (1 min. at 95»C and 3 min. 
at 72°C) with the last cycle followed by extension at 72»C for 
10 min. 

5 . Rest r i ct i on o f amp 1 i cons 

a. Remove mineral oil, combine the contents of each 
of 2 PCR tubes in Eppendorf , extract with 600 m1 of phenol and 

20 phenol/chloroform. 

b. Add 1/10 vol. of 3 M NaOAc and equal volume of 
isopropanol, incubate for 15 min. in ice bath, spin, wash, dry. 
Resuspend Driver and Tester ampl icons in TE at concentration 
0.2-0.4 mg/ml (e>~ acting 10-20 Mg of DNA amplicon from one PCR 
tube), check DNA concentration using EtdBr solution (2 Mg/ml). 

c. Digest both Driver DNA (200 Mg) and Tester DNA 
(20 Mg) with initially choc an restriction endonuclease in order 
to cleave the adaptors, extract and iProOH ppt. as above. 

d. Resuspend Driver amplicon DNA digest in TE at 
30 approx. 1 mg/ml and Tester amplicon DNA digest at 

0.2-0.4 mg/ml. Measure Driver and Tester DNA concentrations 
by EtdBr fluorescence and agarose gel electrophoresis. Adjust 
Driver DNA concentration to 0.5 mg/ml and Tester DNA 
concentration to 0.1 mg/ml. 
3 5 6. Change of adaptors on Tester amplicon 

a. Load 10 Mg of Tester amplicon DNA digest on 2% 
NuSieve agarose gel (low melting point, FMC Bioproducts) . 



25 
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b. Cut agarose slice (0.2-0.4 g) containing fragments 
150-1500 bp in length and put it in a 5 ml Falcon tube. Add 
0.4 ml of 0,5 M MOPS pH 7.0, 0.4 ml 5 M NaCl and 3 ml of ddH : 0. 

c. Mix, melt at 72°C in a heating block for 10 min. , 
5 repeat this step one more time. 

d. Pass warm solution (30-50°C) through Qiagen-tip20 
(Qiagen Inc.), elute and precipitate DNA material as 
recommended by the supplier. Dissolve DNA pellet in 30 m! of 
TE buffer, check DNA concentration by EtdBr fluorescence, 

10 adjust to 0.1 mg/ml. 

e. Ligate 2 Mg of purified Tester DNA amplicon DNA 
digest to primer set 2, as described above, dilute with TE plus 
tRNA up to 10 Mg/ml (25 Mg/ml for Hind III representation). 

II. DNA hybridization/ amplification steps 

15 l. Hybridization 1. 

a. Mix 80 Ml of Driver amplicon DNA digest (0.5 mg/ml) 
and 40 Ml of diluted Tester amplicon ligate (0.4 Mg for 
representations made with most six cutters, 1 Mg for Hind III 
representation) , extract once with phenol/chloroform. 

20 b. Add 30 Ml of 10 M NH«OAc and 380 m! (2.5 vol.) of 

EtOH, chill at -70°C for 10 min. incubate at 37°C for 2 min., 
spin, wash twice with 70% EtOH, dry. 

c. Resuspend the pellet in 4 m1 of ~E x 3 buffer 
(30 mM EPPS from Sigma, pH 8.0 at 20°C, 3 mM EDTA) by vortexing 

25 for 2 min., spin the sample to the bottom and overlay with 
3 5 Ml of mineral oil. 

d. Denature DNA for 3-4 min. at 98 °C in a heating 
block, carefully add 1 m! of 5 M NaCl to the DNA drop and 
incubate at 67 °C for 20 h. 

30 2. Selective amplification 

a. Remove oil, add 8 m1 of tRNA solution (5 mg/ml), 
mix, add 390 m1 of TE buffer and mix again. 

b. To fill-in the adapter ends, make 2 tubes with 
360 ul of PCR mix (see above), not including 24-mer primer. 

35 Add 40 Ml or hybridized DNA dilution in each tube, place in 
Thermocycler at 72 °C, add 3 m1 of AmpliTaq DNA polymerase, mix, 
and incubate for 5 min. Add 10 m! of 24-mer primer (set 2), 
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mix, overlay with mineral oil and perforin 10 cycles of PCR as 
above. For J Bgl 24 primer lower annealing temperature (70°C) 
is required. 

c. Phenol and phenol/chloroform extract, iProOH ppt. 
5 as above, dissolve the pellet in each tube in 20 m! of ddH : 0, 

combine. 

d. Take 20 Ml of the amplified difference product 1, 
add 20 Ml of 2 x mung bean nuclease buffer and 2 Ml of mung 
bean nuclease (10 U/m1, NEB) , incubate at 30°C for 30 min. Add 

10 160 Ml of 50 mM Tris-HCl pH 8.9, inactivate the enzyme by 
5 min. incubation at 98 °C. Prepare 2 tubes with a PCR mix 
(360 Ml), containing J 24-mer primer, add 40 Ml of MBN-treated 
difference product in each tube and make PCR for 15 cycles as 
above . 

15 e. Run 10 m1 of . the amplificate on a 2% agarose gel, 

estimate the quantity of DMA (usually 0.1-0.3 Mg) and, if 
necessary to improve the yield, make 2-4 additional cycles 
after addition of 3 m1 of fresh AmpliTaq DNA polymerase. 

3. Change of adapter on a difference product 

20 a. Extract with phenol and phenol/chloroform, iProOH 

ppt. as above and dissolve the pellet at approx. 0.1 mg/ml. 
Determine DNA concentration by EtdBr fluorescence, adjust up 
to 0.1 mg/ml. 

b. Digest difference product with chosen restriction 
25 enzyme (10 U/Mg) , extract as above and EtOH ppt., wash, dry, 

dissolve at 20 ng/Ml. 

c. Take 10 Ml (200 ng) of DNA solution and directly 
ligate to adapter 3 (primer set 3) in a volume 60 m! as 
described above. Dilute the ligated difference product up to 

30 1.25 ng/Ml (2.5 ng/Ml for Hind III representation) with 100 Ml 
of TE buffer containing tRNA (20 m1 for Hind III). 

4. Subsequent hybridization/amplification steps 

a. For second hybridization mix 40 m1 (50 ng) of 
adapter ligated difference product (100 ng for Hind III 
35 representation) and 80 m1 (40 Mg) of Driver amplicon DNA 
digest. Proceed through hybridization/amplification step as 
above. 
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b. For third hybridization/amplification step take 
100 pg of difference product 2 ligated to the adapter 2 (400 pg 
for Hind III representation), making final amplification after 
MBN treatment for 20 cycles (25 for Hind III representation). 
5 c. For Hind III representation sometimes the fourth 

hybridization/amplification step is needed. Take 5 pg of 
difference product 3 ligated to adapter 3 with final 
amplification for 27 cycles, 

III. Cloning and analysis of difference products 
10 l. Cloning 

a. Take 10 Mg of the difference product after the last 
hybridization/amplification step, digest with chosen 
restriction enzyme, extract with phenol and phenol/chloroform, 
EtOH ppt. 

15 b. Dissolve obtained DNA in 100 m1 of TAE buffer and 

make 2% low melting point (LMP) gel electrophoresis and DNA 
purification as above. 

c. Dissolve digested difference product in 30 m1 of 
TE buffer, check the concentration and dilute an aliquot 

2 0 (2-5 jig) up to 10 ng/ml with tRKA containing TE buffer. 

d. To ligate the difference product in a plasmid 
vector mix: 

1 Ml of 10 x ligase buffer, 
6 Ml of ddH 2 0, 

25 1 Ml (10 ng) of gel-purified difference product DNA 

digest, 

i Ml (40 ng) of any pUC-derived vector, digested 
with chosen restriction enzyme and dephosphorylated, 

1 Ml (400 U) of T4 DNA ligase. 
30 Incubate for 1-3 h at 16°C and dilute by addition 

of 70 Ml of tRNA containing TE. 

e. Transform the competent DH 5a cells in a standard 
way. Plate on LB agar containing ampicillin, X-Gal, and IPTG. 

2. PCR amplification of cloned inserts 
35 a - Prepare PCR tubes each containing 100 m! of 

standard PCR mixture and sequencing and reverse sequencing 
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primers (seq. 24 and rev. 25, respectively, see Table) 
(500 pmol of each per tube) . 

b. Pick and transfer one white bacterial colony in 
each tube, vortex and place in Thermocycler at 95°C for 5 min. 
5 c. Lower the temperature by switching to 72 °C, add 

1 Ml (5 U) of AmpliTaq polymerase, mix, overlay with mineral 
oil and perform PCR for 30 cycles (1 min. at 95°C, 3 min. at 
72 °C) with final extension at 72 °C for 10 min. 

d. Analyze the yield and the size of the amplified 
10 fragments by 2% gel electrophoresis of 5 m1 aliquots. Purify 

chosen DNA fragments by Qiagen-tip20 chromatography, iProOH 
ppt. , wash, dry and dissolve in 30 /xl of TE. 

e. Determine DNA concentration by EtdBr fluorescence. 
For blot hybridizations dilute 1-2 jig of each fragment up to 

15 10 Mg/ml with tRNA containing TE buffer. 

Example 3. Application of rd a to isolating DNA probes that 

detect gene amplific ation in cancers. When tumor DNA 

was taken as tester and normal DNA from humans was taken as 
driver, RDA yielded difference products that hybridized to 

20 amplified sequences in the tumor DNA. This is an unanticipated 
result, th* probable consequence of the kinetic enrichment 
during RDA. Probes that detect amplified sequences in human 
cancers are of clinical value, since the presence of such 
sequences usually indicates a poor prognosis. For example, 

25 amplification of N-myc or the NEU oncogenes indicates poor 
prognosis for neuroblastoma or breast cancer, respectively. 

Difference products were found when DNA from a melanoma cell 
line or DNA from a small cell lung cancer cell line was used 
as tester and normal DNA from the individual donors, 

30 respectively, was used as driver. The difference products for 
the 1st, 2nd and 3rd round subtractions of the melanoma were 
subject to electrophoretic separation, and are shown in Figure 
1, right hand panel, lanes a, c and e. The difference products 
for the 1st, 2nd and 3rd rounds of subtractions of the lung 

3 5 cancer are shown in lanes b, d and f . size markers are in lane 
g, with lengths in basepairs indicated at right. The melanoma 
cell line was AH-Mel, and the small cell carcinoma cell line 
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was H1770. When some of the difference products were used as 
nucleic acid hybridization probes in genomic blots of 
restriction endonuclease cleaved human DNA from a variety of 
cancer cell lines, they detected sequences amplified in the 
5 small cell carcinoma cell line (top panel, left side of Figure 
1) or the melanoma cell line (middle and lower panel, left side 
of Figure 1) . The probes derived from the RDA analysis of the 
small cell carcinoma cell line also detect amplified sequences 
in a neuroblastoma cell line IMR-5 (top panel, left side) . The 
10 RDA probes were determined to map to human chromosome 2 (small 
cell lung carcinoma) and chromosome 3 (melanoma) by hybridizing 
them to a panel of *• ^chromosomal hybrid cells #2 obtained 
from NIGMS Human Genetic Mutant Cell Repository. No 
amplifications on chromosome 3 have been previously described. 
15 Next, was determined that driver DNA need not derive from the 
same individual as the tester. RDA was performed using DNA 
from the melanoma cell line as tester and using DNA from either 
the matched individual donor, an unmatched individual, or a 
pool of 10 unmatched individuals as driver. The same pattern 
Pf difference products was found whichever driver DNA was used 
(see Fig. 2). Thus tester and driver DNAs do not have to 
derive from the same individual when one is searching for 
probes that detect amplified DNA present in the tester. 
Example 4. The use of rda to d iscover w«w viruses. Human 
25 prostate cancer biopsies were analyzed using RDA. DNA 
extracted from a surgical biopsy of a prostate cancer was used 
as tester and DNA from normal tissue of the same individual was 
used as driver. A single difference product was obtained and 
sequenced. Computer analysis demonstrated that this difference 
sequence corresponded most closely to a rat LINE element, a 
member of repeated sequences found interspersed throughout the 
rat genome (see Fig. 3 for a sequence comparison). 
Oligonucleotide PCR primers derived from the extreme left hand 
and right hand sequences of this element were used to 
3 5 demonstrate its presence in various DNAs. Its presence was 
detected in rat DNA, and two different regions of the human 
prostate cancer, but not in the DNA from normal tissues of the 
human in which the cancer arose. Thus genetic information from 



20 
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rats has been found in human tissue, presumably through the 
agency of a virus. The DNA sequences of this presumed virus 
may be obtained by "chromosomal walking" from the inserted 
element. One may infer a causal role of this virus in the 
5 etiology of this cancer. 

Example 5. The use of RDA to isolate probes that detect 
genetic lesions in cancer. Using DNA from pure or nearly pure 
(>90%) cancer cells as tester and DNA from normal cells of the 
respective patient as driver many difference products were 

10 obtained. These difference products detected either loss-of- 
heterozygosity, hemizygous loss on chromosome Y, or homozygous 
loss in the tumor DNAs. The probes from RDA were mapped to 
human chromosomes. The results are summarized in Table 3. As 
tester, DNAs from four different renal cell carcinoma cell 

15 lines UOK114, UOK124, UOK132 and UOK112 were used, and one 
esophageal cancer biopsy, from patient #758. One probe, 
RCC124.1 (footnote d from Table 3) also detected homozygous 
loss on chromosome 2 in one additional renal cancer cell line 
and two bladder cancer cell lines. One probe, RCC132.12 

20 (footnote e from Table 3) also detected homozygous loss on 
chromosome 9 in two melanomas. One probe, BAR. 6 (footnote f 
from Table 3) also detects homozygous loss on chromosome 3 from 
several colon cancer cell lines* Probes that detect homozygous 
loss may be useful to define loci that encode tumor suppressor 

25 genes. Methods that detect loss of function of tumor 
suppressor genes may be useful in the clinical typing of 
cancers. 

Table 3: Application of RDA to the pairs of normal and tumor 
DNA's (tumor DNA as Driver). 





RDA fragments 




Experiment 


Selected for 

initial 
characterizat 
ion a 


Found to 
be 

informati 
ve b 


Chromosome 
s 

affected 0 


1. Renal cell 
carcinoma, cell 
line UOK114 
(male) 


12 


4(1/3/0) 


3/3,3,10 
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2. Renal cell 
carcinoma, cell 
line UOK124 
(female) 


11 


5(2/3/0) 


2 d /ND 


5 


3. Renal cell 
carcinoma, cell 
line UOK132 
(male) 


10 


9(0/3/6) 


-/9 C ,9,5 


10 


4. Renal cell 
carcinoma, cell 
line UOK112 
(male) 


13 


13(0/0/13 
) 


-/- 


15 


5, Barrett's 
esophageal 
cancer, patient 
#758, sorted 
nuclei (male) 


5 


5(1/0/4) 


3 f /- 




Total 


38 


23 

(4/9/10) 





a. Clones with distinct insert sizes. 

20 b. Entries in parentheses (x/y/z) show distribution of 
fragments according to type of loss, where x is number of 
probes detecting homozygous loss, y the number detecting loss 
of heterozygosity, and z the number detecting hemizygous loss 
from the Y chromosome. 

25 c. Chromoiomal location of probes, where x/... are the 

locations of probes detecting homozygous loss, and /x the 

locations of probes detecting loss of heterozygosity. ND means 
not yet determined. 

d. Probe RCC124.1 also detects homozygous loss in bladder 
30 cancer cell lines. 

e. One probe, RCC132.12, detected homozygous loss on 
chromosome 9 in melanomas. 

f. Probe BAR. 6 also detects homozygous loss in four out of 
seven colon cancer cell lines and one bladder carcinoma cell 

35 line. 



Example 6. The appl ication of RDA to the analysis of DNA from 
pool3 of individual. RDA may be applied to the discovery of 
4 0 polymorphisms that are genetically linked to an inherited trait 
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such as a disease susceptibility or a behavioral abnormality 
in humans • To utilize RDA for this purpose, it is desirable 
to use pools of DNAs from a group of individuals for use as 
either tester, driver or both. When used this way, RDA may 
5 yield probes that detect polymorphic alleles that are present 
in one group and not in another. In particular, when such 
pools are used as driver, RDA yields probes for restriction 
endonuclease polymorphisms (PARFs) that distinguish tester from 
all individuals in the driver pool. When pools are used as 
10 tester, RDA yields PARFs that distinguish at least one member 
of the tester pool from the driver individual. In the most 
challenging example, when both tester and driver are pooled 
DNAs from groups of individuals, RDA yields PARFs that 
distinguish at least one member of the tester group from all 
15 members of the driver group. 

This is illustrated, in Table 4. Two groups of humans were 
taken: ten that shared a genetic abnormality, neuronal ceroid 
lipo-fuscinosis, also known as Batten's disease, and ten that 
did not have this condition. DNAs were prepared from cells of 
20 each individual and pooled accordingly. Pools of DNA were used 
for RDA using DNA from one group as tester and DNA from the 
other as driver, and then reversing the procedure. In each 
case difference products were obtained that detected PARFs. 
In Table 4 the probe name is listed, and "+ w indicates that it 
25 detected the small allele of the PARF in a given individual. 
As the Table shows, when normal individuals were used as 
tester, probes (pAl, pA2, pA4, and pA9) were obtained that 
detected small PARF alleles in at least one member of the 
group, and this allele was always absent in the individuals 
30 with Batten's disease. Similarly, when DNAs from the affected 
group was used as tester, probes (pN2, pN7, pN9, pN13 and pN15) 
were obtained that detected small PARF alleles in at least one 
member of the affected group, and this allele was always absent 
in the normal group. 

35 Table 4: Screening for presence of Bgl II PARF's in 20 human 
DNA amplicons 



Length of 
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10 



/i£ L cCUcQS 




Normals 




small 


riODc 1 £ J 4 3 O / O y 


1 0 


123456789 


lOallele (bp) 


pAl * 




+ + 




300 


pA2 




+ + + + 


+ 


120 


pA4 








150 


pA9 




+ + 




400 


pN2 


+ 






425 


pN7 + + + + 


+ 






300 


pN9 








350 


pN13 + + + 








400 


pN15 


+ 






600 



20 



30 



Example 7. The use of RDA in obtaining probes that reflect 
differences in RNA populations. RDA can be applied to compare 
populations of double stranded cDNAs derived from RNA. The 
15 difference products will yield probes that detect sequences 
expressed among the RNA from one source that are not 
equivalently expressed in another. Such probes are sometimes 
of use in diagnosis (e.g. to determine the origin of a cell, 
or to find evidence of infection) and can lead to the discovery 
of important tissue-specific or disease related genes. 

A double stranded cDNA population was prepared from RNA 
extracted from a male mouse brain. This was used as driver. 
A one hundred thousandth part of double stranded DNA from the 
kanamycin resistance gene encoded by an E. coli plasmid was 
25 added to a small portion of this cDNA, and this used as tester. 
This model system mimics the case of a single small difference 
between the expressed RNAs from two sources. RDA was performed 
on these two samples using the enzyme Sau3A to prepare the 
respective amplicons. The difference product after two rounds 
of substraction was separated using gel electrophoresis, as 
shown in Fig. 4. In the left hand lane is shown an 
electrophoretic separation of amplicons prepared from 1.2 Jcb 
of the kanamycin gene, in the middle lane were size markers. 
The difference product from the RDA is seen in the right hand 
3 5 lane. This product was derived from the kanamycin gene as 
shown by blot hybridization, thus proving that RDA can be used 
to detect differences in DMAs derived from RNA populations. 

It is evident from the above results, that a powerful 
tool has been provided for isolating probes which can be used 
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to identify sequence differences between two related genomes. 
This technique may be used in a wide variety of contexts in 
relation to forensic medicine, detecting the presence of 
pathogenic DNA, lesions occurring in neoplastic cells, genetic 
5 counseling, the presence of genes associated with genetic 
diseases, and the like. 

All publications and patent applications cited in this 
specification are herein incorporated by reference as if each 
individual publication or patent application were specifically 
10 and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in 
some detail by way illustration and example for purposes of 
clarity of understanding, it will be readily apparent to those 
of ordinary skill in the art in light of the teachings of this 
15 invention that certain changes and modifications may be made 
thereto without departing from the spirit or scope of the 
appended claims. 
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WHAT IS CLAIMED IS: 

1. A method for producing probes capable of 
distinguishing at least one sequence difference between DNA 
from two related different eukaryotic sources, said method 
5 comprising: 

substantially completely digesting separately the DNA 
from said two different sources with a restriction endonuclease 
to provide digested fragments , wherein one of said sources is 
driver DNA, and the other source is tester DNA, wherein said 
10 tester DNA comprises target DNA, wherein said target DNA 
comprises sequence differences between the DNA of said two 
sources; 

ligating a first set of adaptors to said digested 
fragments and amplifying said fragments using primers to one 
15 of the strands of said first set adaptors, to provide amplified 
amounts of fragments of said digested sequences of less than 
about 2k bp as ampl icons; 

carrying out a first round of the following steps for 
enrichment of target DNA: 
20 removing said first set of adaptors from said ampl icons 

and ligating a second set of adaptors to the 5' ends of the 
amplicons of tester DNA; 

combining under melting and annealing conditions said 
tester amplicons with a large excess of driven amplicons, 
25 whereby a portion of the resulting dsDNA comprises self- 
annealed tester DNA including target DNA; 

filling in the 3' ends of annealed DNA; 

amplifying said dsDNA with primers complementary to one 
of said strands of said second set of adaptors to enrich for 
3 0 target DNA; 

optionally repeating said first round of steps as a 
second round or successive round, to provide DNA sequences 
which serve to identify differences in DNA sequences between 
said tester source and said driver source. 

3 5 2. A method according to Claim 1, including the 

additional step after said filling in of digesting single 
stranded DNA with a nuclease. 
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3. A method according to Claim 1, wherein said first 
round of steps is repeated at least once. 

4. A method according to Claim 3, wherein different 
sets of adaptors are used for at least the first three rounds. 

5 5. A method according to Claim 1, wherein said 

digesting is with a restriction endonuclease which has a 
recognition sequence of at least 6 nucleotides and provides a 
staggered cleavage. 

6. A method according to Claim 1, wherein the 
10 sources of DNA are cells from related human individuals or the 

same individual. 

7. A method according to Claim 1, wherein said DNA 
from said two related sources is cDNA. 

8. A method according to Claim 1, wherein said DNA 
15 from at least one of said two related sources is DNA pooled 

from a plurality of individual related sources. 

9* A method for producing probes capable of 
distinguishing at least one sequence difference between genomes 
from two related cellular sources, said method comprising: 

2 0 substantially completely digesting separately the DNA 

from said two different sources with a restriction endonuclease 
having a nucleotide recognition sequence of at least 4 
nucleotides, wherein one of said sources is driver DNA, and the 
other source is tester DNA, wherein said tester DNA comprises 
25 target DNA, wherein said target DNA comprises sequence 
differences between the genomes of said two sources; 

ligating a first set of adaptors to said digested 
fragments and amplifying said fragments using primers to one 
of the strands of said first set adaptors to provide amplified 

3 0 amounts of fragments of said digested sequences of less than 

about 2kbp as amplicons; 
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carrying out a first round of the following steps for 
enrichment of target DNA: 

removing said first set of adaptors from said amplicons 
and ligating a second set of adaptors to the 5' end of 
5 amplicons of tester DNA; 

combining under melting and annealing conditions said 
tester amplicons with a large excess of driver amplicons , 
whereby a portion of the resulting dsONA comprises self- 
annealed tester DNA includi g target DNA; 
10 filling in 3' overhangs; 

amplifying said dsDNA with primers to one of said 
strands of said second set of adaptors to enrich for target 
DNA; 

repeating said first round of steps for at least 2 
15 rounds,, using a different set of adaptors in each successive 
round for said 2 rounds to provide a DNA composition comprising 
a predominant amount of target DNA; 

cloning said DNA composition to provide clones having 
a substantially homogeneous probe of putative target DNA; 
20 with the proviso that when a plurality of probes of 

putative target DNA are obtained, optionally including the 
additional step of: 

hybridizing said probes of putative target DNA with 
dr: /er and tester amplicons, whereby probes of putative target 

2 5 DNA binding to both driver and tester amplicons are discarded. 

10. A method according to Claim 9, wherein said 
related .Voman cellular sources are from the same individual and 
differ as to the suspected presence of a pathogen. 

11. A method according to Claim 9, wherein said 

3 0 related human cellular sources are from the same individual and 

differ as to the suspected presence of a genetic lesion. 

12. A method according to Claim 9, wherein said 
related human cellular sources are from different individuals. 
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13. A method for producing probes capable of 
distinguishing at least one sequence difference between genomes 
from a neoplastic cell source and a related normal cell source, 
said method comprising: 
5 substantially completely digesting separately the DNA 

from said two sources with a restriction endonuclease having 
a nucleotide recognition sequence of at least 4 nucleotides, 
wherein said normal cell source is driver DNA, and said 
neoplastic cell source is tester DNA, wherein said tester DNA 
10 comprises target DNA, wherein said target DNA comprises 
sequence differences between the genomes of said two sources 
comprising at least one of an insertion, deletion, 
rearrangement or DNA amplification defining target DNA; 

ligating a first set of adaptors to said digested 
15 fragments and amplifying said fragments using primers to one 
of the strands of said first set of adaptors to provide 
amplified amounts of fragments of said digested sequences of 
less than about 2k bp as amplicons; 

carrying out a first round of the following steps for 
20 enrichment of target DNA: 

removing said first set of adaptors from said amplicons 
and ligating a second set of adaptors to 5' ends cf amplicons 
of tester DNA; 

combining under melting and annealing conditions said 

2 5 tester amplicons with a large excess of driver amplicons, 

whereby a portion of the resulting dsDNA comprises self- 
annealed tester DNA including target DNA; 

filling in the 3' ends of overhangs; 

amplifying said dsDNA with primers to one of said 

3 0 strands of said second set of adaptors to enrich for target 

DNA; 

repeating said first round of steps for at least 1 
additional round, using a different set of adaptors as to the 
previous round in each successive round to provide a DNA 
3 5 composition comprising a predominant amount of target DNA; and 

cloning said DNA composition to provide clones having 
a substantially homogeneous probe of target DNA. 
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14. A method for producing probes capable of 
distinguishing at least one sequence difference between genomes 
from a neoplastic cell source and a related normal cell source, 
said method comprising: 
5 substantially completely digesting separately the DNA 

from said two sources with a restriction endonuclease having 
a nucleotide recognition sequence of at least 4 nucleotides, 
wherein said neolastic cell source is driver DNA, and said 
normal cell source is tester DNA, wherein said tester DNA 
10 comprises target DNA, wherein said target DNA comprises 
sequence differences between the genomes of said two sources 
comprising loss of heterozygosity, homozygosity or hemizygous 
loss to define target DNA; 

ligating a first set of adaptors to said digested 
15 fragments and amplifying said fragments using primers to one 
of the strands of said first set of adaptors to provide 
amplified amounts of fragments of said digested sequences of 
less than about 2kbp as amplicons; 

carrying out a first round of the following steps for 
20 enrichment of target DNA: 

removing said first set of adaptors from said amplicons 
and ligating a second set of adaptors to 5' ends of amplicons 
of tester DNA; 

combining under melting and annealing conditions said 
25 tester amplicons with a large excess of driver amplicons, 
whereby a portion of the resulting dsDNA comprises self- 
annealed tester DNA including target DNA; 

filling in the 3' ends of overhangs; 

amplifying said dsDNA with primers to one of said 
3 0 strands of said second set of adaptors to enrich for target 
DNA; 

repeating said first round of steps for at least 1 
round, using a different set of adaptors as to the previous 
round in each successive round to provide a DNA composition 
3 5 comprising a predominant amount of target DNA; and 

cloning said DNA composition to provide clones having 
a substantially homogeneous probe of target DNA. 
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15. A kit comprising at least two probes prepared 
according to the method according to Claim l. 

16. A kit comprising at least two probes prepared 
according to the method according to Claim 9. 

17. A kit comprising at least two probes prepared 
according to the method according to Claim 13. 

18. A kit comprising at least two probes prepared 
according to the method according to Claim 14. 
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